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Abstract 

Empirical analysis show that, after the update of a browser, the publication of the vulnerability 
of a software, or the discovery of a cyber worm, the fraction of computers still using the older 
version, or being not yet patched, or exhibiting worm activity decays as power laws ~ l/t a with 
< a < 1 over time scales of years. We present a simple model for this persistence phenomenon 
framed within the standard priority queuing theory, of a target task which has the lowest priority 
compared with all other tasks that flow on the computer of an individual. We identify a "time 
deficit" control parameter [5 and a bifurcation to a regime where there is a non-zero probability for 
the target task to never be completed. The distribution of waiting time T till the completion of the 
target task has the power law tail ~ 1/i 1 / 2 , resulting from a first-passage solution of an equivalent 
Wiener process. Taking into account a diversity of time deficit parameters in a population of 
individuals, the power law tail is changed into l/t a with a £ (0.5, oo), including the well-known 
case 1/t. We also study the effect of "procrastination", defined as the situation in which the target 
task may be postponed or delayed even after the individual has solved all other pending tasks. 
This new regime provides an explanation for even slower apparent decay and longer persistence. 
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I. INTRODUCTION 



There is growing evidence that remarkably robust quantitative and sometimes universal 
laws describe the behavior of humans in society. Consider a typical individual, who is 
subjected to a flow of information and of requested tasks, in the presence of time, energy, 
regulatory, social and monetary constraints. Such an individual will respond by a sequence 
of decisions and actions, which themselves contribute to the flow of influences spreading to 
other individuals. A recently developed approach to unravel at least a part of this complex 
ballet consists in quantifying the waiting time distribution between triggering factor and 
response performed by humans, which has been found in many situations to be a power 
law pdf(t) ~ l/t p with an exponent p less than 2. As a consequence, the mathematical 
expectation of the waiting time between consecutive events is infinite, which embodies the 
notion of a very long persistence of past influences. This power law behavior has been 
documented quantitatively for the distribution of waiting times until an email message is 
answered jl], for the time intervals between consecutive e-mails sent by a single user and 
time delays for e-mail replies for the waiting time between receipt and response in the 
correspondence of Darwin and of Einstein [sj], and for the waiting times associated with other 
human activity patterns which extend to web browsing, library visits and stock trading 4j. 

A related measure concerns the rate of activity following a shock, a perturbation, an 

announcement and so on, that impacts a given social system. For instance, measures of 

media coverage after a large geo-political event decay approximately as a power law of time 

since the occurrence of the event 15 1. The rate of downloads of papers from a website after 

fl Q 

a media coverage also follows a power law decay |6j, [7J. The rate of book sales following 
an advertisement or large media exposition decays as a power law of the time since that 
event 0, ij. The rate of video views on YouTube decays also as a power law after peaks 



argues that many other systems are described 



associated with media exposure [10]. Ref. 
by a similar behavior. 

These two measures of human reactions, (i) distribution of waiting times between trig- 
gering factor and response and (ii) rate of activity in response to a "shock", are related. 
This fact has been exemplified by the rate of donations following the tsunami that occurred 
on December 26, 2004 [12j. A donation associated with this event can be considered as 
a task that was triggered (but not necessarily executed) on that day simultaneously for 
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a large population of potential donors. This task competes with many others associated 
with the jobs, private lives and other activities of each individual in the entire population. 
The social experiment provided by the tsunami illustrates a general class of experiments in 
which the same "singular task" is presented at approximately the same time to all potential 
actors (here the donors), but the priority value of this singular task can be expected to be 
widely distributed among different individuals. Since the singular task has been initiated 
at nearly the same time for all individuals, the activity (number of donations) at a time 
t after this initiation time is then simply equal to iV x pdf(t), where N is the number of 
individuals who will eventually act (donate) in the population and pdf(t) is the previously 
defined distribution of waiting times until a task is executed. 

These observations have been rationalized by priority queuing models that describe how 
the flow of tasks falling on (and/or self-created by) humans are executed using priority 
ranking j^, [3, Q]. Assuming that the average rate A of task arrivals is larger than the average 
rate p, for executing them, and using a standard stochastic queuing model wherein tasks 
are selected for execution on the basis of random continuous priority values, Grinstein and 
Linsker derived the exact overall probability per unit time, pdf(t), that a given task sits in 



the queue for a time t before being executed 



13|: 



pdf(*)~^, for/i<A. (1) 
Grinstein and Linsker showed that the distribution (fj]) is independent of the specific shape of 



the exponent p = 3/2 
and with most but 



the distribution of priority values among individuals [14]. The value o 

nn 

is compatible with previously reported numerical simulations |2J, [3J, 
not all of the empirical data. 

Our present theoretical study can be considered both as a pedagogical simplification 
and an extension of Grinstein and Linsker [l4, [14J], with the goal of exploring different 
mechanisms explaining the deviations of the exponent p for its canonical value 3/2. In 
particular, we reveal below the fundamental statistical origin of the power law (TjQ) with 
exponent 3/2 as nothing but a first-passage problem of an underlying random walk jlol ]. 
The initial motivation for the present study came from recent quantitative empirical studies 



3, 



13, 



19] on the time decay of the use of outdated browsers on the Internet and of the 



remaining detectable and surprisingly significant activity on the Internet of the Blaster worm 
since 2003 up to present [if], Q. The activity in these systems reveals the equivalent of 
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the survival distribution of browsers or of computers which have not yet been updated or 
patched (in the language of priority tasks, this is the fraction of all entities which have not 
yet accomplished the task). These activities are found to decay as ~ l/t a , with a = p — 1, 
where the exponent is one unit less than for pdf(t) since it describes the decay of the 
complementary cumulative (also known as the survival) distribution of entities that have 
not yet acted. The new element is that a is found different from 1/2, sometimes smaller, 
while it is larger in other cases. Here, we ask what could be the simplest explanations for 
such behaviors. 

The structure of the paper is the following. Section HT1 presents the model of a target task 
which has the lowest priority compared with all other tasks that flow on the "shoulders" 
(or computer) of an individual. The distribution of waiting time T till the completion of 
the target task is formulated as a first-passage time of an approximately equivalent Wiener 
process with drift. There is a control parameter, that we call the "time deficit" parameter 
(3, which is proportional to the drift of the associated Wiener process. It is proportional to 
the difference between the average time (r]) to complete a non-target task and the average 
time interval (r) between non-target task arrivals: (3 oc (rj) — (r). For small /3's, the 
probability density function (pdf) q(t) of T has a power law tail l/t 1+a with exponent a = 
1/2. Its corresponding complementary cumulative distribution Q(t) exhibits a bifurcation 
as a function of (3. For (3 < but close to 0, Q(t) ~ l/t a and tends to zero at long time. For 
(3 > 0, Q(t) ~ Qoo + C/t a , where Qoo is the non-zero probability that the target task is never 
completed. Section [Till extends the preceding analysis to a population of individuals with 
different time deficit parameters (3. We distinguish between regular distributions around 
(3 = and non-regular ones. For the former, the exponent a is changed into the value 
1 by the effect of heterogeneity. For distributions of (3 that allow for positive values, the 
survival distribution Q(t) exhibits again a non-zero asymptotic limit at large times. For 
non- regular distributions of (3, the exponent a is found to be continuously tunable from 1/2 
to +oo. Section HVl introduces the mechanism of "procrastination", defined as the situation 
in which the target task may be postponed or delayed even after the individual has solved 
all other pending tasks. In the limit where the procrastination inclination is large and the 
time deficit parameter is close to zero, we find that the pdf q(t) of T exhibits a new much 
slower power law tail ~ l/t 1-Q , with a = 1/2 in the regular case. The survival distribution 
Q(t) is characterized by a slow cross-over to the asymptotic power law ~ l/t a . Section IVl 
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concludes. 



II. MODEL AND STANDARD SOLUTIONS 

A. Formulation in terms of a specific target task with the lowest priority 

Let us assume for definiteness, but without loss of generality, that the target task is 
identified at the origin of time t = 0. For concreteness, we will frame our discussion by using 
the examples of the task of updating your browser version on your computer to the newly 
available version. Another example is the task of patching one of your softwares, after its 
vulnerability has been disclosed and its patch has been made freely available. Our goal is 
to derive the distribution of waiting times or, equivalently, the dependence with time of the 
fraction of the population that has not yet performed the task. 

Starting with the classical theory of prioritized queue, we assume that the target task 
has the lowest priority among all other user's tasks. In other words, the users consider 
updating their browser or patching their softwares as doable only after all their other tasks 
have been addressed. This captures the casual empirical observation that computer users 
are often reluctant to interrupt their work, social chatting and blogging, games and other 
activities on their computer for an update or patch that often requires a complete shutdown 
and restart. 

The time at which the target task is performed is denoted T: it corresponds to the time 
interval over which the user has been busy doing other things. We therefore refer to it as the 
"busy time duration." By definition of T, for any t 6 (0,T), there are still other unsolved 
tasks that requires the attention of the individual, while at the instant t = T, all tasks that 
arose earlier have been solved. In the present section, we assume that, once freed of other 
preoccupations at time T, the individual who has been presented with the target task at 
t = will finally perform it immediately. In section IIVI we investigate another situation in 
which, once free of other constraints, the user nevertheless procrastinates. Then, new tasks 
may appear in the meantime, leading to further delays in the completion of the target task. 
This procrastination mechanism leads to new slower decay laws and interesting cross-over 
regimes. But, with the present assumption that the target task is addressed as soon as 
the user is free of other tasks, we obtain that the complementary cumulative distribution 
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function Q(t) of waiting times till the update of the browser coincides with the probability 
that the busy time duration is larger than the given instant t: 



Q(t) = Pr{T > t} . (2) 

We now analyze in detail the components contributing to the busy time duration T. 
Consider first all the tasks which were present before t — when the new target task was 
first presented to the individual, and which have not yet been completed. Let us assume 
that a time 770 is still need after t — to solve these tasks. Then, in the scenario in which 
no new task occur, we have 

Q(t) = Pr{ m > t} . (3) 

We consider now all the other scenarios in which new tasks may fall on the shoulders of the 
individual after t = 0. Specifically, let us assume that the number of such new tasks grow 
with t according to some staircase function n(t), which increases by one unit at the discrete 
occurrence times 

< 7\ < T 2 • • • < T n{t) < t . (4) 

We denote by rjk the time needed by the individual to solve the fc-th task. Then, by definition 
of the busy time duration T, Q(t) is given by the probability of the chain of events C\k=o ^fe> 



Q(t) = Pr{T > t} = Pr 

where 



n(t) 

n a 

k=0 



(5) 



A k = r] + W{k) - T k+1 > (6) 

and 

k 

w{k) = Y,m, w(o) = o. (7) 

i=l 

In ([6]), it is understood that T n ^ + i = t 

Expression ([51) shows that the chain flfc=o °f events At defined by (J6j) determines Q(t) 
completely. It is thus important to have a detailed understanding of it. First, the event 
with index is nothing but 

A = Vo>T 1 , (8) 

which corresponds to the scenario in which the time r] that the individual needs to solve 
all tasks stored up to t = is larger than the time 7\ at which the first new task appears 



after the time t = at which the target task has been assigned. Since the individual is still 
solving other tasks, she cannot perform the target tasks before the arrival of the first new 
task at T\. The event 

A 1 = r]o + Vi> T 2 (9) 

represents the scenario in which the individual is still solving the tasks that were not yet 
finished before t = or the new task that appeared at time T\ when the second task occurs 
at time T 2 . Again, the individual is busy until time T 2 and cannot address the target task. 
The set of events Ak for all fc's up to n(t) follow the same structure, so that the individual 
has still had not time to address the target task at time t < T . Figure 1 illustrates this 
chain flfc=o -A-k of events. 

1o ^2 nn((, 




Fig. 1: Schematic illustration of the chain of events Dk^A-k, leading to the occur- 
rence of T > t, where T is the "busy time duration" until the target task is performed. 



We now introduce the auxiliary stochastic process 



v{k) = j>* - r k ) 



1/(0) = 



1=1 



where 



T k — T k+ i — T k 



(10) 



(11) 



is the time interval separating the occurrence of the fc-th and (k + l)-th tasks arising after 
t = 0. It is also convenient to define 



Vo = Vo- Ti 



i • 



(12) 



as the time missing to complete all tasks stored up to t = when the first new task occurs 
at time 7\ > 0. The case r]' < is excluded as it would correspond to the completion 
of the target task at time t]q before the arrival of the first task at time T\. In this case, 
all subsequent tasks become irrelevant. We also assume for simplicity that r]' Q is a fixed 
deterministic value (we will relax this condition later on), so that 



Pr 



n(t) 

n a 

fc=0 



Pr{V(k) > -T]' } , for any k G (0,n(t)) . 



(13) 
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The sought complementary distribution Q(t) defined by expression (J5J) is therefore given by 

Q(t) = Pi{V(k) > —7]' : k e (0, n(t))} . (14) 

Figure 2 shows a typical realization of the stochastic process V(k) defined by expression 
(HOP over a time interval in which the inequality V(k) > —T]' defining Q(t) in (1141) holds. 
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k 



Fig. 2: Typical realization of the stochastic process V(k) defined by expression (TlOl) 
as a function of the discrete argument k indexing the successive tasks appearing after 
t = at which the target task has been initiated. This realization shown here obeys 
the inequality V(k) > — r]' defining Q{t) in (|14|) over the whole time interval shown. 

B. Approximation in terms of a Wiener process (random walk) with drift 

We assume that the two sequences {%} and {r^} are made of i.i.d random numbers, with 
mean and variance respectively equal to (77), and (r), of. 

In order to obtain the asymptotical properties of Q(t) given by (fT4|) at large times t 3> (r), 
we apply the Law of Large Numbers (LLN) that justifies replacing in (fT4"j) the random number 
n(t) by its mean, 

n(t)^(n(t)) = = t/(r) , (15) 
leading to the following asymptotically exact expression for Q(t), 

Q(t) w Pi{V(k) > —i]' : k E [0,9]} . (16) 

8 



For large times, and thus large #'s, the Central Limit Theorem (CLT) ensures that the 
process V(k) defined by ffTOl) can be interpreted as the discrete version of a Wiener process 
with drift, with the following mean value and variance 

(V(k)) = k ■ ((n) - (r» , Var[K(*)] = a 2 ■ k , a 2 = o\ + a 2 . (17) 

Moreover, for S > 1, we can replace V(k) by its continuous limit, in the form of the 
standard Wiener process with drift, depending on the continuous argument k. Accordingly, 
the probability density function (pdf) f(v; k) of the continuous stochastic process V(k) 
satisfies the diffusion equation 

df(v,k) dfi^k) a 2 d 2 f(v;k) 

supplemented by the initial condition 

f(k;0) = 5(v) . (19) 

Recall that f(v; k)dv is the probability to find V(k) between v and v + dv at "time" k. 

From the theory of Wiener processes, and within the asymptotically exact continuous 
limit just described, it follows that the probability given (Tl6l) is given by 

/oo 
f(v;6\ v ' )dv , (20) 

where f(v; k\rj' Q ) is the solution of the diffusion equation fj 1 8p satisfying the initial condition 
( TT9|) and the additional absorbing boundary condition 

f{v-kW Q )\ =0. (21) 

The solution of the initial-boundary problem ffl8|) . ffT9l . (j2T]l is 

f( v .k\ V ' )=g(v-((r ] )-(T))k;k)- 



cxd ( - 2{{V) 2 {T)Wo ) g(v - ((rj) - (r))k + 2 V >; k) 



(22) 



a' 



where 



1 / v 2 



3M) ^vS eXP U • (23) 



Substituting expression ff22l) into ff20l yields 



with the following notations 



* , = M^>, , « (25) 
<T V a (r) 



The corresponding pdf is 



dQ(t) i 7 / {se + if \ 
q{t) = -— = «71^ exp I ^~ J ' (26) 



In the analysis that follows, the key role played by the parameter 5 defined in ([25]) warrants 
further interpretation. Let us assume that the occurrence of new tasks at the times {T^} 
defined by t^j is a Poisson flow with rate A. Similarly, we assume that the completion of 
tasks at the times {W(k)} defined by (J7J) is also a Poissonian flow with rate \i. We thus 
have 

(r) = a T = \, (rj) = a ri = - . (27) 

(t) is the mean time between arriving tasks and (rj) is the mean completion time of the 
tasks. Accordingly, the parameter 5 is equal to 

5 = j , where e — A (rj) — — . (28) 

Ve 2 + 1 H 

When 

( T ) = (rf) 5 = => e = 1 , (29) 

the rate of new task arrivals is equal to the rate of solving them. This balanced condition 
corresponds to a zero drift in the associated Wiener process, and plays a crucial role in the 
generation of power laws in the distribution of waiting times. When positive, the parameter 
5 quantifies the "time deficit" that is missing on average per task in order for the individual 
to finally be able to complete the target task. For a negative time deficit (5 < 0), the target 
task is almost surely performed in finite time, as we show below. 

It is instructive to analyze separately the complementary cumulative distribution Q(t) 
given by (1241) and its corresponding pdf q(t) given by (126]) of the waiting time for the target 
task to be done (browser upgrade or software patched). We will discuss different situations 
in which Q(t) and q(t) are described by asymptotic power laws 

Q(t) ~ Qoo + 0~ a q(t) ~ Q- a ~ x , (30) 

where may be non-zero in some interesting cases to be discussed below. 
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C. Derivation of the power law pdf q(t) of waiting times till the completion of the 
target task 



We first rewrite the pdf q(t) given by (1261) in a form more convenient for its analysis. For 

this, we notice that q(t) is controlled by two characteristic scales 

7 2 2 
6 1 = — , 6 S = • ( 31 ) 

For definiteness, consistent with our previous assumption that r]' is constant, we take 7 (i.e. 
7 ) to be constant. We then explore the behavior of q(t) for different values of 5 (i.e. 6s). It 
is convenient to introduce the new variable 



6 26 



and the parameter 



The dimensionless pdf 



takes the form 



P=ff 5 =\^- (33) 
n(p;(3) = ^-^(r)q(t) (34) 

^ = ^ (_C&±1£) . (35) 

The dependence of n(p\ /3) as a function of p is qualitatively different for (3 <C 1 (0 7 <C #5) 
and for /3 > 1 (0 7 > 6s). For /3 > 1, n(p; [3) does not exhibit any power law asymptotic, not 
even in an intermediate domain of p. In contrast, for 

(36) 

the pdf /c(p; /3) possesses the intermediate power asymptotic 

k( P] (3) « p^ 2 , 1 < p < /T 2 , (37) 

which is replaced, for larger p, by the exponential decay 

k( P ; (3) « p^V^" . (38) 

Note that the function n(p; f3) is nothing but the pdf of first return to the absorbing boundary 
condition defined in (j2ip (the target task is performed) of the Wiener process with drift with 
the characteristics ffTTl) 11511. In the balanced case ([29]) 



(3~ 2 = 00 
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the power law (137|) holds for any p > 1. 

Figure 3 illustrates the cross-over of n(p; f3) from the intermediate power law asymp- 
totic (13T|) and the exponential tail (13"5j) for three increasing values of the normalized drift 
parameter f3. 




10 1 10° 10 1 10 2 10 3 10 4 

p 



Fig. 3: Dependence of n{p\ (3) given by (|35p as a function of p for three values of the 
normalized drift parameter = 0.01;0.05;0.1. The dashed straight line correspond to 
the pure power law ~ p~ 3 ^ 2 . The larger f3 is, the narrower is interval in p for which 
the intermediate power asymptotic holds. 

D. Derivation of the survival distribution Q{t) of waiting times till the completion 
of the target task 

The complementary cumulative distribution Q(t) given by ff24|) can be rewritten in the 
following form, which is more convenient for the analysis of its asymptotic behavior: 

g(() , fi(p; ,)4( 1+erf (i±|£)-^e rf c(i^)). ,39, 

This allows us to show that the function Q(p; j3) has a qualitatively different behavior at 
p —>■ oo for (3 > and for (3 < 0. This can be seen from the corresponding limits of Q(p; /3): 

\Qoo(P) , P > , 

hm<2(p;/3)= Q 00 ((3) = l-e- 4 ^ . (40) 

P ^°° [o , p < . 
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For (3 > 0, Q(t) tends to a strictly positive limit Qoo{P) > as p -> +oo. For (3 < 0, 
Q(p; /3) tends to zero as p — > +oo. These two limits have the following simple probabilistic 
interpretations. For (3 > 0, (77) > (r): the average time needed to complete a task is larger 
than the average inter-arrival times between new incoming tasks. As a consequence, there 
is strictly positive probability Qoo(P) > that the target task will never be completed. In 
the language of the drifting Wiener process with characteristics (fT7|) . in the presence of a 
positive drift, there is a finite probability Qoc(fl) for the Wiener process to escape to infinity 
(the target task is never completed), and a probability 1 — Qoo{P) f° r being captured at 
the absorbing boundary defined in ( l2Ti) (the target task is performed). In contrast, for 
(3 < 0, (77) < (r), and the individual will almost surely complete the target task (update 
her browser or patch her software) in finite time. Thus, for (3 < 0, Qao(fl) = 0. Crossing 
the value f3 = is analogous to a phase transition or bifurcation characterized by the order 
parameter Q CQ {(3) varying as a function of the control parameter (3: for (3 < 0, the order 
parameter is zero and it bifurcates to a non-zero value for (3 > 0. 

Equation (1391) expressed for the balanced case (3 = yields the following power law 
asymptotic 

Q(p;0)«-^=, p>l, (41) 



corresponding to power law exponent a = 1/2, as defined in (l3"Uj) . For |/3| > 1, <2(p; (3) 
does not exhibit any power law asymptotic, not even in an intermediate domain of p. For 
\(3 1 <C 1, the power law fT4T]) holds as an intermediate asymptotic in the interval 

1<P<P~ 2 - (42) 

For p ^> /9~ 2 , the intermediate power law asymptotic fj4Tl) crosses over to an exponential 
decay converging to for (3 < or to Qoo(P) > given in fT40l) for /? > 0: 

Figure 4 plots the dependence of <2(p; /3) as a function of p for, /3 = ±0.001, which illustrates 
the qualitatively different behavior of <2(p; (3) for (3 > and for /3 < 0. 
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Fig. 4: Complementary cumulative distribution Q(p; (3) for (3 = ±10 -3 as a function 
of the normalized waiting time p for completing the target task, demonstrating the 
qualitatively different asymptotic behavior of Q(p; (3) for (3 > and for < 0. The 
dashed straight line shows the power asymptotic (|4ip corresponding to the balanced 
case (3 = 0. 



III. DISTRIBUTIONS OF THE TIME DEFICIT PARAMETER LEADING TO 
DIFFERENT POWER LAW EXPONENTS AND REGIMES 

A. Regular distribution of the normalized time deficit parameter (3 around the 
origin 

1. Qualitative justification of the form of the normalized time deficit parameter [3 

As shown in the previous section, the proximity to the balance condition ( 1291) is essential 
for the power laws ( 1371) or ( 14T1) to hold over an intermediate asymptotic region sufficiently 
large to be observable (at least over one to two decades in time). 

In the present theory, the time deficit parameter 5 or equivalently its normalized version (5 
are exogenously given. In reality, 5 {(3) embodies the interplay between the subtle processes 
of task formation, of prioritization and the efforts undertaken to solve them, that each 
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individual adjusts continuously. We conjecture that users of browsers and of softwares adapt 
approximately but not exactly, of course, to the balance condition ff29l) . This is done for 
instance by being selective among the flow of tasks (by deleting needless incoming emails or 
ignoring some superfluous tasks), and/or by adapting the time allocated to solving tasks, so 
that the mean time (rj) needed to solve a given problem is approximately equal to the mean 
time interval (r) between subsequent arriving tasks. If this was not the case, the individual 
would in general adjust the allocation of her time. Suppose for instance that (rj) > (t). 
In this case, the individual is not able to face the flow of incoming tasks and a boundless 
number of tasks piles up, suggesting a non-sustainable regime either for the computer or its 
user. In the opposite case (77) < (r), the individual sits idle a significant fraction of her time. 
By enlarging the definition of what is meant by "task" to include other activities, including 
the recreational activities that arguably constitute a significant part of the utility or pleasure 
driving individuals, it is clear that the case (77) < (r) is not realistic as a sustained regime. 
We also conjecture that the adjustment process leading to the convergence of (r) towards (77) 
and vice-versa may describe the general problem of task flow versus their solutions, beyond 
the specific problem of browser update and software patching discussed here, to encompass 
the general balance of human activities. We thus believe that the results presented here are 
of broader interest and may help understand the general statistical properties of the time 
allocation of humans. 

Following these arguments, while individuals can be expected to adjust towards the bal- 
ance condition ( 1291) . it is unlikely that all humans will do so accurately. Indeed, in many sys- 
tems subjected to noise in which state-dependent control actions are performed, the control 



parameter never settles but continues to fluctuate around the target parameter |2Q|,l2l|,l22|. 
Therefore, we propose that the population of browser and software users can be described 
by a distribution of time deficit parameters 5 (/3) which is centered on 0. We thus take 
into account the mentioned fluctuations by considering 5 as random variable with some pdf 
4>(S). The idealized balance condition 5 = is replaced by the more realistic mean balance 
condition: 

/oo 
5<P(5)d5 = . (44) 
-00 

Assuming that 7 is some deterministic constant as done above, then the mean balance 
condition (1441) is equivalent to 

{(3) = . (45) 
15 



2. Derivation of the pdf q{t) for a Gaussian distribution of the normalized time deficit param- 
eter (3 



In order to obtain concrete quantitative predictions, let us first consider that the pdf 
^{(3) of the random variable (3 is a Gaussian law centered on and with standard deviation 
A>: 

m ft) = -jL- exp (-^) . (46) 



The idealized balance condition (3 = is recovered in the limit (3$ — > for which the Gaussian 
pdf (1461) tends to the Dirac function ip(/3) = 5(j3). With the choice ()46l) . the mean balance 
condition ( )45i) holds by construction. 

Amazingly, it turns out that the fluctuations of the parameter (3, that satisfy the mean 
balance condition ((3) = 0, drastically change the asymptotic form of the distribution Q(t) 
and its associated pdf q(t), compared with the idealized case in which the balance condition 
f3 = holds exactly for each individual at all times. Let us first analyze this effect on the pdf 
of waiting times for the completion of the target task, which is now given, in its normalized 
version by the weighted average with respect to (3 of «(p; (3) given by (1351) : 



R(p)= K(p;P)i/>(P)dl3. (47) 

J — oo 

Using the Gaussian distribution (l4"6{) . this yields 

R(p; (3 ) = ; 1 exp ( 2I3 \ 9 - 1 , (48) 



which has the following asymptotic behavior 

R(p; (3 ) « -ir- ~ 2 ~ P~ 2 , P > /3 " 2 • (49) 
V^vrpo P 

Thus, the exponent a, defined in f l30|) . changes from the value a = 1/2 for the idealized 
balance condition into a = 1 in the presence of fluctuations of the time deficit parameter 
from individual to individual and/or as a function of time. The mechanism acting here 



is similar to the mechanism of "sweeping of an instability" 23|], since the presence of a 
distribution of time deficit parameters around the balance condition (3 = indeed amounts 
to sweeping the control parameter (3 over its bifurcation point defined in subsection III D[ We 
stress that this "renormalization" of the exponent a from the value 1/2 to 1 is not sensitive 
to the details of the shape (j4"6j) of the distribution of the time deficit parameter (3. The 
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single essential feature is that if)(f3; (3q) goes to a constant for (3 — > 0. For any distribution 
Po) having this property of going to a non-zero constant as (3 — > 0, the asymptotic tail 
032]) holds. We will discuss in subsection IIIIBI variations to this conditions and derive the 
corresponding changes in the exponent a. 

For the specific form (j46l) . the above asymptotic result (I49p can be made more accurate 
as follows. For (3q > 1, R(p; (3) presents the unique power law regime (1491) with exponent 
a — 1. For (3q <C 1, there is an additional intermediate power law asymptotic with exponent 
a = 1/2 in the interval (analogous to (1421) ) 



which is followed beyond the crossover point p* ~ (3q 2 by the power law (T4"9l) with exponent 
a = 1. Figure 5 shows the dependence of R(p; (3) as a function of p given by expression (J3ED, 
for three different values of (3 that illustrate the intermediate asymptotic with a = 1/2 and 
the tail asymptotic with a = 1. 



i < p < A 



.-2 
'0 



(50) 




10° 



p 



Fig. 5: Dependence of the normalized pdf k(p; (3q) given by expression (|48|) of the 



waiting times until the completion of the target task as a function of the normalized 



time p for [3 = 0.01; 0.1; 1. For the smaller values of (3, one can observe the intermediate 
asymptotic ~ p~ 3 / 2 progressively crossing over to the tail asymptotic ~ p~ 2 . 
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3. Derivation of the survival distribution Q(t) for a Gaussian and a semi- Gaussian distribution 
of the normalized time deficit parameter ft 

In the presence of a distribution of time deficit parameters (3, the complementary cumu- 
lative distribution Q(t) of the waiting time until the completion of the target task can be 
written as 




P 



Fig. 6: Dependence of the averaged cumulative distribution Q(p; /?o) as a function of 
the normalized time p. The solid lines correspond to /?o = 0.1; 0.05; 0.01; 0.005; 0.001 
(top to bottom). The dashed straight line is the power law asymptotic (|4ip . 

Figure 6 plots of the dependence of Q(p; (3q) as a function of p for various values of j3o- 
For all non-zero values of /3o, one can observe that the asymptotic tail exhibits an upward 
curvature, leading to a departure from the a priori expected dependence Q(p; fio) ~ P 1 - 
And the larger (3q is, the slower is the decay of Q{p;/3o), which becomes even slower than 
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0.35 




0.1 0.2 0.3 0.4 0.5 



A) 

Fig. 7: Plot of limiting probability Qoo(Po), preventing shaping the power law of 
averaged complementing distribution Q(p; Po) for p — > oo 

The origin of the contradiction between the well-defined asymptotic power law (I49j) for the 
pdf and the behavior shown in figure 6 stems from the existence of the transition occurring 
at Pq = above which Q(p; P) acquires a non-zero limit Qoo(P) given by fj40l at p — ► +oo, as 
explained in subsection III D[ Accordingly, the averaged complementary distribution Q(p; Pq) 
exhibits the strictly positive limit 

lim Q(p; p ) = 0oo(/3o) = / Qoo(P)W\ PoW . (52) 
p-* 00 JO 

For the Gaussian distribution (j46p . this limit is given by 

Qoo(A) = i (l - e 8/3 « erfc(2v / 2 / 9 )) . (53) 

Figure 7 shows the dependence of this limit Qoo(Po) as a function of /?o- In the context 
of browser updating and software patching, this predicts a regime in which an interme- 
diate asymptotic Q(p; Pq) ~ is followed by a slow cross-over to a positive plateau, 
corresponding to a finite fraction of the population that never upgrades or patches. 

In subsection IHI A 1\ we argued that individuals confronted with a flow rate ~ 1/ (r) 
of tasks and the desire to solve them characterized by the average solution time (rj) tend 
to adjust (r) towards (77) and/or vice- versa. Let us here consider the possibility that (r) 
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remains marginally smaller than (77), so that tasks do not accumulate. Taking into account 
the heterogeneity of humans and the variability with time of their strategy, this corresponds 
to changing the Gaussian distribution into the semi-Gaussian distribution 



^-(/3;/3 ) 







'2 1 



p > 
p < 



( >L\ (54) 

nfk^y 2(H) 

The corresponding average complementary cumulative distribution of the waiting times till 
the completion of the target task reads 



Q-{r,Po) = J~ f Q(p;/?)ex P 

V 7T Po J -00 



p 2 



2/?o 2 , 



dp . 



(55) 



Since the contributions of non-zero values of Qoo(A)) are removed by this specification of the 
distribution of P, Q_(p; p ) exhibits a well-defined asymptotic power law ~ Figure 8 
shows the function Q_ (p; Pq) as a function of p for different Po values, and illustrates the 
crossover from the power law ~ p -1 / 2 for p < Pq 2 (condition (1501) ) to ~ p _1 at large times 

P- 



10 



O) 



10" 



10 









" - A> = 0.001 




\ A = 0.01 




A) = 0.1 \ N. 




\ \ \^ 




\ \ \ 
\ \ 











10 



10 



10 



10 



10 



Fig. 8: Dependence of complementary cumulative distribution Q_(p;/?o) given by 
(|55p as a function of the normalized waiting time p till the completion of the target 
task. Top to bottom: Pq = 0.001; 0.01; 0.1. One can observe the crossover from 



P 



x l 2 to ~ p 1 for Pq = 0.01 and the unique power law 



,-1 



for po = 0.1 
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B. Non-regular distribution of the normalized time deficit parameter (3 around 
the origin 



The assumption of a Gaussian or semi-Gaussian pdf (1461) for the distribution of the 
normalized time deficit parameter (3 is representative of the general class of distributions 
which are regular close to the origin, i.e., converge to a non-zero constant for (3 — *■ 0. As we 
showed in the previous subsections, it is the regular behavior around (3 = which controls 
the tail of the pdf and survival distribution of waiting times. It is therefore interesting to 
investigate the consequence of the existence of less or more probable deviations from the 
critical value (3 = 0. For instance, as mentioned in subsection IIII A 11 if individuals adjust 
their time deficit parameter (3 oc (rj) — (r) via state-dependent control actions, we can expect 
deviations from the assumption that the fluctuations of (3 are smooth around (3 = 0. We 
capture such possibility by considering the following asymptotic behavior of the distribution 



ij_((3;(3 \v)^A(p ,v)\f3W -1< v < +oo , /3 -»• -0 , (A < oo) . (56) 



The case v = recovers the regime of subsection IIII Al In order to remove the impact of the 
super-critical domain (3 > on the survival distribution, we also assume here that all /3's 
are negative. It is a simple matter to remove this condition and recover a regime of non-zero 
asymptotic behavior for the survival distribution, as discussed in subsection IIII A3! 

The corresponding pdf of the waiting time till completion of the target task is given by 



In order to determine its asymptotic behavior for large p's, it is sufficient to replace this 
expression by 



of /3's 




(57) 




(58) 



which, using (1381) . yields 





'OO 



(59) 



Similarly, the survival distribution of waiting times is given by 




a(v) 



(60) 
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Since v e (—1, oo) for i/j-(/3; /3o\v) to be normalized, formula (|56|) shows that the exponent 
a can only take values in the interval 

a e (-,oo) . (61) 



For instance, Diibendorfer et al. 



19| report a value a = 2/3 for the decay of the fraction 



of computers that still keep an outdated Firefox 2 browser. Within the present framework, 
this corresponds to v = —2/3, i.e., to a significantly stronger concentration of j3 close to 
than would be found expected from a semi-Gaussian distribution for instance. 



IV. THEORETICAL FORMULATION OF THE IMPACT OF PROCRASTINA- 
TION: NEW POWER LAW REGIMES 

In the previous sections, we have assumed that, as soon as all other tasks are solved, the 
individual addresses without delay the target task with the lowest priority that now comes 
to the front. In the present section, we explore the consequences of the different possibility 
that procrastination kicks in, so that the target task is postponed and delayed needlessly 
due to carelessness or laziness, or for whatever other reason. 



A. Model and mathematical solution 



Consider the flow of new tasks occurring at the times given by (J4j) and the process V(k) 
defined in ffTUj) . Let us denote by t n the times when V(k) touches the value — rj' from above 
(see figure 2), at which the individual is freed from all tasks except the final target task. The 
set {t n } are the beginnings of the time intervals in which the individual is free to address 
the target task. Let N(t) be the random number of such free moments in the time interval 
(0, t) and let us call 

P(n;t) = Pr{iV(t) = n} (62) 

the probability that the number of spare times in (0, t) is exactly equal to n. We assume 
that the individual will procrastinate in such a free moment with probability < z < 1. 
This is the probability for not upgrading your browser or non patching your software in one 
of your free times. For simplicity, we consider z to be independent of the duration of the 
free time interval. It would not be difficult to consider alternative specifications dependent 
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on the duration for the free time interval but, for most reasonable versions, the main result 
on the power law tails obtained below are not modified. Assuming that procrastination is 
independent in successive free time intervals, the probability that the individual does not 
complete the target task until time t is given by 

oo 

Q(t,z) = Pr{T > t} = ^P(n;£)z n , (63) 

n=0 

where T is the waiting time till the target task is completed. 

In order to calculate Q(t,z) given by ( |63l) . we need the expression of P(n;t). For this, 
we relate it to the probability 

Fit; n) = Pr{t n < t} (64) 

that, for a given n, the random variable t n does not exceed t. The relation between P(n; t) 
and F(t; n) is 

P(n;t) = F(t;n) -F{t;n + 1) (n > 1) , P(0; t) = 1 - F(t; 1) . (65) 

Expression fl65l) writes that the number n of free intervals occurring in (0, t) is determined 
by the condition that the n-th free time interval starts before t while the (n + l)-th free time 
interval starts after t. 

Substituting ( 1651) in relation (1631) yields 



Q(t,z) = l + (z-l)Y,F(t;n)z n ~ 1 . (66) 



n=l 



It is more convenient to work with the pdf of the waiting time till the completion of the 

dt 



target task, defined by g(t, z) = — 9Q ^' Z ^ . From ( 1661) . we obtain 



q(t,z) = (l-z)Y,f(t;n)z n - 1 , (67) 

n=l 

where 

dF(t;n) 

f(^ n ) = — qI — ( 68 ) 

is the pdf of the random variable t n of the beginning of the n-th free time interval. 

As it should, the limit z = in (1671) recovers the pdf given by (}2~4"1) of the waiting time 
till the completion of the target tasks, 

q(t,0) = f{t;l) = f 1 (t) . (69) 
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The beginning time t n of the n-th free interval can be written as the sum of n waiting 
times: 

t n = At 1 + ... + At n . (70) 

The first waiting time At\ is the duration of the time interval starting at the inception 
time t = of the target task until the individual is free to address the target task for the 
first time. We denote its pdf as fi(t). The other terms At 2 ,---,At n quantify the waiting 
times between successive beginnings of free time intervals. They are independent identically 
distributed random variables, with common pdf denoted /(£). The expressions for fi(t) and 
f(t) are made explicit in subsection IIVDI Then, the pdf of the sum (!70|) is equal to the 
n-times convolution 

f{t]n) = f 1 (t)®f(t)®"-®f{t) (71) 

n— 1 times 

The Laplace transform of f(t; n) is thus 

f{s- n) = J o fit- n)e~ st dt = f^s) ■ f n ~\s) , (72) 

where fi{s) and f(s) are respectively the Laplace transforms of fi{t) and f(t). 
Applying the Laplace transform to both sides of equality (!67|) . we obtain 

q(s,z) = (1 - z)f\(s) £ r-\s)z^ = ^"^^ . (73) 

n=l l-zf{Z) 

As shown in subsection IIV D\ the pdf's fi{t) and f(t) have in general the following power 
law asymptotic 

f\{t)~ ai -t- a - 1 , /(ij-a-r"- 1 , t^oo. (74) 
This implies that their Laplace transform have the following asymptotic 

f x (s) ~ l + r(-a) -ai -s a , /(s) ~ l + r(-a) • a • s Q , s^O. (75) 
Substituting these last relations into (173"]) yields the asymptotic form of q(s, z) 

where 

x = - a . r(- a ) • ( X > 0) . (77) 
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B. Expression of the probability z for not completing the target task in one of the 
free times 



Let us denote by (k the duration of the k-th free interval, from the beginning time tk to 
the arrival of the first new task. Note that one cannot interpret (k as the duration of the 
time interval during which the Wiener process V(t) remains below the level — rj' Q , because 
the formulation in terms of a Wiener process has a sense only for V(t) > —T]' . Actually, (k 
has the simple interpretation of being the waiting time counted from any arbitrary time till 
the occurrence of a new task. 

Consider the simple instance in which the tasks arrive according to a Poisson flow with 
rate A, such that the pdf of (k reads 

tt(C) = Ae^ AC . (78) 

One can interpret 1/A as the mean waiting time between task arrivals. For the sake of 
clarity, let us also assume that the probability not to perform the target task during a free 
time is a decreasing exponential function of the duration (k of that free interval: 

P(0 = e- AlC . (79) 

One can interpret 1/Ai as the average "procrastination time." Expression f!79|) assumes that 
the individual decides to perform the target task during one of her free time according to a 
constant probability per unit time, i.e., according to another Poisson process with rate Ai. 
Averaging this probability over the statistics of ( yields the probability z that the target 
task will not be performed during a given free time interval: 

roc roo \ 

z = / P(CMCR = / e- x ^\e-^dz = —— . (80) 
Jo Jo A + Ai 

If Ai <C A, i.e., if the arrival rate of new tasks is significantly larger than the rate with 
which the individual fights her procrastination, then the probability z not to perform the 
target task in a given free interval is close to unity. As we see below, this regime 2; — >■ 1 is 
responsible for a much slower decay of the pdf and survival distribution of the waiting times 
till the completion of the target task. 
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C. Derivation of the distribution of waiting times till the completion of the target 
tasks in the presence of pronounced procrastination (z — > 1) 
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251 ]) that the inverse Laplace transform of (1761) is 



It is well-known (see, for instance, 
equal to 

q(t,z) = ^K^-^,aj , (81) 

where n(y, a) can be expressed as the weighted sum of exponential distributions 

f°° 1 ( y\ 
«) = / - exp £(//, a)dfx , (82) 



with weights 

1 sin(7ra) . „ s 

C m, « = a ^ _ a \_ 9 7 . • 83 

7T/i /i a + /i a + 2 COS(7TQ;) 

The corresponding complementary distribution is 

K.(y,a) = J K,(x,a)dx = J exp ^— — ^ £(/i, a)d// . (84) 
Expression (1821) predicts the existence of the two regimes 

K(y, a) ~ — ^ , for y < 1 ; and n(y, a) ~ — ^ , for j/ > 1 . (85) 
This translates into 

q(t, z) ~ x" 1 — , for t < 5 and g(t, z) ~ x -A_ , for t > X 1/a • (86) 

Figure 9 plots the pdf n(y,a) given by (1821) for a = 0.7 and confirms the existence of an 
intermediate asymptotic power law regime q(t, z) ~ l/t 1 ^ a for t <C x 1 ^, which decays much 
slower than in absence of procrastination (z — leading to a small \ and to g(t, z) ~ l/t 1+Q ). 
The two regimes could be observed in the data by taking the derivative as a function of time 
of the fraction of remaining individuals who have not yet solved the target task. 

For Ai <C A, we can approximate expression ( IBUl) by z = 1 — so that \ given by (1771) 
is approximately equal to x — ~~ a ' r(— a) • ^- ^> 1, i.e., it is proportional to the ratio of the 
average "procrastination time" over the mean waiting time between task arrivals. It is this 
ratio j- that determines the range of the intermediate asymptotic power law q(t, z) ~ l/t 1 "", 
which holds for t < (A/Ai) 1/a . 
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Fig. 9: Plot of the pdf a) given by (|82[) as a function of the reduced variable y = 
t/x 1 /" for a = 0.7. The two dashed lines correspond to the intermediate asymptotic 
and to the tail asymptotic given by (|85p. 



If we look directly at this fraction (JC(y, a) in normalized units) of non-solved target task, 
we do not find two clear power law regimes, but rather a smooth cross-over to the asymptotic 
power law tail JC(y, a) ~ as shown in figure 10. 
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Fig. 10: Dependence of the normalized complementary cumulative distribution 
)C(y, a) given by (|84[) . for a = 0.7, as a function of the normalized time y = t/x l ^ a - 
The dashed straight line corresponds to the asymptotic power law ~ y~ a . The two 
dotted straight lines are power laws ~ y _1 /3-i/4^ ^ Q ^ e e y e SU gg es t the presence 
of an intermediate apparent slow power law decay over some limited time range. 



D. Derivation of the pdf of the waiting times between successive beginnings 
of free time intervals 

Let us now justify the form ( 1741) for the pdf of Ati and of the pdf f(t) of the other 
independent random variables At 2 , At n as denned in ( 1701) . 

As we showed in section [TXJ, the pdf of Ati coincides (in the Wiener process approximation) 
with the pdf for the Wiener process V(t) of first touching the level — r)' : 

This has the form (174")) with a = 1/2, when the time deficit parameter 5 is close to zero. 

The other random variables At2, ...,At n are each the sum of two independent contribu- 
tions, Atk = (k + Atk, where (i) (k is the duration of a free time interval, which has the same 
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distribution as that of the waiting time counted from any arbitrary time till the occurrence of 
a new task and (ii) Atj. is a time similar to At\ for the Wiener process V(t) of first touching 
some new level. In order to specify further the properties of this second contribution Atj., 
we recall that the pdf fi(t) of Ati given by (18"T|) depends on the parameter 7 = ^- defined 
in (1251) . where r]' is the time needed by the individual to solve tasks that has been stored, 
and a n is the standard deviations of the times rjk needed to solve the fc-th task. The pdf 
fit) of Atk can be written as 

poo 

f(t) = / w^Mtl^dj , (88) 
J 

where the notation /i(t|7) makes explicit the dependence on 7 in expression (IHTj) . The 
integral in ([581 is performed over the random variable 7 weighted by its pdf 10(7), which is 
determined as follows. The end of an interval which was free of any task (except the target 
task that remains to be addressed) is triggered by the occurrence of a new task that takes 
priority over the target task, and the pdf of the time needed to solve it is (p(rj), with mean 
and variance equal to (77) and cx^, as defined in subsection III Bl We thus have 

w(j) = a v ip{a v ^) . (89) 

To illustrate, suppose that <p(r}) is exponential, (p(rf) = \ie~ m , leading to a v — 1/fM and 

w ( 7 ) = e -7 . (90) 

Substituting (JHTJ) and ([90]) into flEFJ yields 

It is straightforward to check that this pdf f{t) has the asymptotic power law (1741) with 
a = 1/2. Now, f{t) is the convolution of the pdf 7r(£) and of fit), and its tail is determined 
by that of fit), hence the form (1741) with a = 1/2. 

V. CONCLUDING REMARKS 

We have developed a simple general framework to model the distribution of waiting times 
between the triggering factor and the actual realization of a job, for the particular tasks that 
are both important (sometimes even essential) but are often considered low priority because 
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they require interrupting the normal flow of work or life. Beyond the examples of Internet 
browser updates and software vulnerability patching which initially motivated our interest 
in this question, we suggest that our theory can apply to less quantifiable but equally 
important questions such as the delay in implementing important decisions in one's life. 
While we recognize of course the existence of additional important psychological factors and 
social influences, our approach provides a simple parsimonious starting point of a general 
theory of procrastination. 

Acknowledgements: We are grateful to R. Crane and T. Maillart for stimulating dis- 
cussions. 
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